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(57) Abstract 

A method for searching for a second character element string from a first character element string obtained by recognizing a character 
string. The first character element string includes a first character element, and the second character element string includes a second 
character element. A predetermined distance between the first and second character elements is set. The distance relates to the similarity 
between the first and second character elements. The searching method comprises the step of comparing the distance with a predetermined 
reference distance and the step of judging whether or not the first character element agrees with the second element, based on the results of 
the comparison. 
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AMENDMENTS 

(Amendment under Seat ion 11 of the Japanese Law Concerning 
International Applications, Etc. Pursuant to the Patent 
Cooperation Treaty) 



To the Commissioner of the Japanese Patent Office 

1, Identification of the International Application 
PCT/JP99/07050 



2. Applicant 

Name MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD, 

Address 1006, oaza Kadoma, 

Kadoma-shl, Osaka 571-8501 Japan 
Country of nationality Japan 
Country of residenoeJapan 

3 • Agent 

Name (7828) Shusaku YAMAMOTO 

Address Fifteenth Floor, Crystal Tower, 2-27, 

Shiromi 1-chome, Chuo-ku. 

Osaka-shi, Osaka S40-6015 Japan 



4. Item to be Amended 
Claims 



5. Content of Amendments 

As described in the annexed document, 

(1) "a plurality of character elements* is amended to 
"character elements at a plurality of locations", and "the 
plurality of character elements" is amended to *the 
character elements at a plurality of locations* (lines 5 and 
7 in claim 8) [corresponding to lines 8 and 14 in claim 8 of 
the English version]. 
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(2) "a plurality of aharaoter elements" is amended to 
"character elements at a plurality of locations' , and "the 
plurality of character elements * is amended to "the 
aharaoter elements at a plurality of locations " (lines 5 and 

7 in claim 17) [corresponding to lines 8 and 14 In claim 17 
of the English version] , 

(3) "a plurality of character elements" is amended to 
"character elements at a plurality of locations", and "the 
plurality of character elements" is amended to "the 
character elements at a plurality of locations" (lines 6 and 

8 in claim 20) [corresponding to lines 9 and 15 in claim 20 
of the English version], 

6. List of Attached Documents 

New sheets for "CLAIMS" on pages 51, 54, 55, and 56 
[corresponding to pages 78, 79, 82, 83, 84, and 85 of the 
English language specification] 

one copy of each 
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- 78 - P21873 



changing the first predetermined reference 
distance to a second reference distance; 

comparing the distance with the second reference 
distance) and 

5 determining whether the second character element 

matches the first character element based on a result of 
the comparison of the distance with the second reference 
distance* 

10 5. A retrieval method according to claim 1, wherein a 
plurality of distances relevant to the similarity between 
the first character element and the second character element 
are predetermined between the first character element and 
the second character element, and 

15 one distance selected from the plurality of 

distances is used as the distance, 

6. A retrieval method according to claim 5, wherein the one 
of the plurality of distances is determined based on user 

20 input. 

7, A retrieval method according to claim 1, wherein the 
distance has a probabilistic distribution. 

* 

25 8. A retrieval method for searching ». first character 
element string obtained by subjecting a character string 
to character recognition for a second character element 
string, 

wherein the first character element string includes 
30 a plurality o£ character elements, 

for a specific character element of the plurality 
of character elements, a plurality of character elements 
having the possibility of being concatenated with the 
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specif ic character element are predetermined, 

the retrieval method comprising the steps of: 
determining whether a character element string 
obtained by concatenating the specific character element 
5 of the plurality of character elements with one character 
element of the plurality of character elements, the one 
character element being different from the specific 
character element, matches at least a part of the second 
character element string* 

10 

9. A retrieval method according to claim 8, comprising the 
steps of s 

selecting one character element from the plurality 
of character elements having the possibility of being 
15 concatenated with the specific character element; and 

determining whether a character element string 
obtained by concatenating the specific character element 
with the selected character element matches at least a part 
of the second character element string. 

20 

10* A retrieval method according to claim 8, wherein the 
specific character element is located at an end of a row 
or column, the plurality of character elements having the 
possibility of being concatenated with the specific 
25 character element are each located at a head of a row or 
column . 

11. A retrieval method according to claim 8, wherein the 
specific character element and one of the plurality of 
30 character elements having the possibility of being 
concatenated with the specific character element are 
located at the same row or column, and 

the specific character element and another one of 
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the retrieval device comprising: 

means for comparing the distance with a 
predetermined reference distance; and 

means for determining whether the seoond character 
element matches the first character element based on a result 
of the comparison of the distance with the predetermined 
reference distance. 

17. A retrieval device for searching a first character 
element string obtained by subjecting a character string 
to character recognition for a second character element 
string, 

wherein the first character element string includes 
a plurality of character elements, and 

for a specific character element of the plurality 
of character elements, a plurality of character elements 
having th« possibility of being concatenated with the 
specific character element are predetermined, 

the retrieval device comprising i 

means for determining whether a character element 
string obtained by concatenating the specific character 
element of the plurality of character elements with one 
character element of the plurality of character elements , 
the one character element being different from the specific 
character element, matches at least a part of the second 
character element string. 

IB- A retrieval device for searching a first character 
element string obtained by subjecting a character string 
to character recognition for a second character element 
string, 

wherein the first character element string includes 
at least one first character element and the second character 
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element string Includes at least one second character 
element , 

the retrieval device comprising : 

means for obtaining a probability that a search 
result matches the second character element string , based 
on the number of the second character elements included in 
the second oharacter element string, and a number of the 
second character elements matching the corresponding first 
character elements out of the second character elements 
included in the second character element string; and 

means for determining the correctness of the search 
result based on the probability. 

19 - A computer readable recording medium in which a program 
for causing a computer to execute a retrieval process for 
searching a first character element string obtained by 
subjecting a character string to character recognition for 
a second character element string is recorded, and 

wherein the first character element string includes 
a first oharacter element and the second character element 
string includes a second oharacter element, 

a distance relevant to a similarity between the 
first oharacter element and the second oharacter element 
is predetermined between the first character element and 
the second character element, 

the retrieval process comprising the steps oft 

comparing the distance with a predetermined 
reference distance j and 

determining whether the second character element 
matches the first character element based on a result of 
the comparison of the distance with the predetermined 
reference distance* 



i- 
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20 . A computer readable recording medium in which a program 
for causing a computer to execute a retrieval process for 
searching a first character element string obtained by 
subjecting a character string to character recognition for 
a second character element string is recorded, 

wherein the first character element string Includes 
a plurality of character elements, and 

for a specific character element of the plurality 
of character elements, a plurality of oharaoter elements 
having the possibility of being concatenated with the 
specific character element are predetermined, 

the retrieval process comprising the steps of s 

determining whether a character element string 
obtained by concatenating the specific character element 
of the plurality of character elements with one character 
element of the plurality of character elements, the one 
character element being different from the specific 
character element, matches at least a part of the second 
character element string. 

21. A computer readable recording medium In which a program 
for causing a computer to execute a retrieval process for 
searching a first oharaoter element string obtained by 
subjecting a character string to character recognition for 
a second character element string is recorded, 

wherein the first character element string Includes 
at least one first character element and the second character 
element string includes at least one second character 
element , 

the retrieval process comprising the steps of: 
obtaining a probability that a search result matches 
the second character element string, based on the number 
of the second character elements Included in the second 
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character element string, and a number of the second 
character elements matching the corresponding first 
character elements out of the second character -elements 
included in the second character element stringy and 

determining the correctness of the search result 
based on the probability. 
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To Examiner of the Japanese Patent Office 

1 . Identification of the International Application 
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Country of nationality Japan 
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Shiromi 1-ohome, Chuo-ku, 
Osaka-ehi, Osaka 540-6015 Japan 

4* Mailing Date January 8, 2000 

5. Content of Arguments 

CD Regarding "claims 1, 16, and 19 lack novelty in 
view of References 1 and 2" 

In References 1 and 2, a probability that a specific 
character is recognised as the same character or other 
characters is held in a table (see Reference 1, Figure 1, 
page 627, and Reference 2, Figure 2 # page 65), a score is 
calculated with reference to the probability in the table, 
and the score is compared with a threshold (see Reference 1, 
page 626, left paragraph, line 28). 

In the present invention, the distance defined 
between character elements is obtained based on the shapes 
of character elements. Therefore, the distance is different 
from a probability (the present specification, page 13, 

1 



3, Agent 

Name 
Address 



THIS PAGE BlANKluspio) 



line 4) [corresponding to page 20, line 17 of the English 
version] . 

It is difficult to obtain the probability of an event 
which will unlikely occur based on the probability 
distributions in References 1 and 2. Moreover , the value of 
such a probability varies to a large extent depending on 
documents used for statistical samples* The distance 
between character elements of the present invention is easy 
to calculate by comparison of the shapes of the character 
elements (e.g., an input -output relationship of a character 
recognition system or a positional relationship in a feature 
quantity space (the present specification, page 13, line 4)) 
[corresponding to page 20 , line 17 in the English version]. 
Therefore, a large number of statistical samples are not 
required, and a relationship between character elements can 
be obtained independent of such statistical samples. 

There is no description regarding a distance between 
character elements in References 1 and 2» Therefore, the 
Applicant argues that the present Invention has novelty and 
Inventive step, 

(2) Regarding "claim 2 lacks novelty in view of 
References 1 and 2" 

In References 1 and 2, a probability that a specific 
character is recognized as the same character or other 
characters is held in a table (see Reference 1, Figure 1, 
page 627, and Reference 2, Figure 2, page 65) , and a 
plurality of candidates having a high probability are 
produced with reference to a posterior probability that a 
character as a result of a recognition result is the same 
character or other characters from the table (Reference 1, 
page 626, left paragraph, line 2B). As a result, the same 
set of candidates are produced for the same recognition 
result . 
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In the present invention, apart from the distance 
between oharaater elements defined in advance, a reliability 
of a recognition result of each character element, is 
provided. A reference distance is determined using the 
reliability. If the same recognition result has a different 
reliability, the tolerable level to recognition error is 
changed. As a result, a desirable search can be performed 
for an incorrect recognition result, and detection of extra 
character element s tring & can be suppre s s ed ( the pr es en t 
specification, page 16, line 26 to page 17, line 11) 
[corresponding to page 26, line 27 to page 27, line 4 of the 
English version]. 

In References 1 and 2, there is no description of the 
reliability given to a recognition result of an individual 
character element. Therefore, the present invention has 
novelty and inventive step. 

(3) Regarding claims 3 through 7 

Claims 3 through 7 are dependent from claim 1 and 
therefore have novelty and inventive step. 

(4) Regarding * claims 8, 17, and 20 lack novelty in 
view of Reference 4" 

Claims 8, 17, and 20 are amended in the Amendments 
submitted on the same date as this Reply. 

In the Amendments, the phrase "a plurality of 
character elements having the possibility of being 
concatenated with a specific character element" is amended 
to the phrase "character elements at a plurality of 
locations having the possibility of being concatenated with 
a specific character element" in order to emphasize what was 
originally intended by the inventors. 
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Such an amendment clarifies that oharaoter elements 
existing at a plurality of (different) candidate locations 
(not fixed) which follow a specific character element 
(location)* In conventional methods, as indicated in 
Reference 4, "a plurality of candidate characters for a 
recognition result of a character following a specific 
character element" are used. 

There is no description regarding a retrieval method 
using * character elements at a plurality of locations having 
the possibility of being concatenated with a specific 
character element" in Reference 4. Therefore, the present 
invention has novelty and inventive step. 

The retrieval method using "oharaoter elements at a 
plurality of locations having the possibility of being 
concatenated with a specific character element" is described 
in the specification of the present application, e^g., at 
page 30, lines 2-8 [corresponding to page 47 , lines 5-18 of 
the English version] (an ending character element and 
leading character elements of a plurality of other 
paragraphs), at page 34, lines 8-15 [corresponding to 
page 53, line 24 to page 54, line 6 of the English version] 
(Figure 20: a recognition result is produced using character 
elements at a plurality of concatenating^ locations ) , and at 
page 35, lines 13-18 [corresponding to page 55, line 29 to 
page 56, line 8 of the English version] (Figure 22A and 22By 
a recognition result is produced under an assumption of 
vertical and horizontal concatenation). As described above, 
a search is performed using character elements at a 
plurality of locations having the possibility of being 
concatenated with a specif la character element. Therefore, 
even when concatenation is incorrectly determined in 
character recognition, a desirable result can be obtained in 
a character element string search. 
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(5) Regarding claim 9 

Claim 9 is dependent from claim 8, and therefore has 
novelty • 

(6) Regarding claims 10 and 11 

Claims 10 and 11 are dependent from claim 8, and 
therefore have inventive step. 

search error la often caused by recognition error in 
character recognition. Therefore, it is a conventional 
objective to realize desirable search using a recognition 
result . 

In general , a concatenation relationship between 
character elements is determined using a recognition result, 
independent of character recognition. 

The present invention is significantly different from 
a conventional method for obtaining accurate character 
concatenation. The inventors focused their attention to a 
novel challenge to achieve a desired search, and made an 
attempt to improve search precision by a new approach in 
which candidates for a concatenation relationship, which 
have not been utilized prior to the present invention, are 
used after character recognition. 

References 5 and 6 focus attention on that a correct 
character is selected from a plurality of candidates to be 
concatenated as accurately as possible. In this case, after 
the selection, information about the candidates cannot be 
utilized. In the present invention, such information about 
candidates which is not used after character recognition is 
newly utilized, thereby obtaining a desirable result in 
searching even if error occurs in concatenation. 
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(7) Regarding "claims 12, 13, 18 , and 21 lack novelty 
in view of Reference 1" 

In Reference 1, "a probability that a character 
string obtained by searching a text as a result of 
recognition is correct" is defined as 

(a) a product of probabilities that each character in 
a character string is correct, or if matches do not occur in 
all characters, 

(b) a partial certainty obtained by comparing between 
the certainty of each character element in a character 
string and multiplying the character elements except for a 
character element having the smallest certainty. 

The method (b) in Reference 1 can be used only when a 
character is a wildcard, and the certainties of character 
elements except for a character element having the smallest 
certainty are only multiplied irrespective of the number of 
character elements in a character string to be matched. 
Ideally, the certainty of a character string as a whole 
should be higher when matches occur in nine out of ten 
characters in a character string (one character wildcard) 
than when a match occurs in one out of two characters in a 
character string (one character wildcard). This point is 
ignored in Reference 1. The product o£ certainties of 
characters excluding a character having the smallest 
certainty is only obtained in Reference 1. 

In the inventions of claims 12, 13, 18, and 21, a 
novel probability is introduced based on the number of 
characters (character elements) in a search character string 
and the number of matched characters (character elements). 
Specifically, when a match occurs in one of the two 
characters in a character string, a probability that a 
character string as a result of recognition is a search 
character string is low, while when matches occur in nine 
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out of the ten characters In a character string, a 
probability that a character string as a result of 
recognition is a search character string is high. This fact 
is utilized in the present invention. Moreover, the number 
of characters used as wildcards is not limited to one. 

Therefore, the Applicant argues that claims 12, 13, 
18, and 21 have novelty over Reference 1. 

(8) Regarding "claim 14 laaks inventive step in view 
of References 1 and 7" 

In Reference 7, when a search result is not obtained, 
a condition is relaxed in such a manner that the number of 
characters used as wildcards is increased. However, if the 
number of characters used as wildcards is increased, a 
number of other word having only several different 
characters are detected as search results. That is, a 
number of other words are likely to be incorrectly retrieved. 

in the invention of claim 14, a reference distance in 
a character element distance table Is set to a large value. 
This means that the number of characters to be subjected to 
comparison is increased by narrowing a search target to 
similar characters which tend to be mlsrecognized. 
Therefore, it is less possible that extra words are 
incorrectly retrieved as compared to Reference 7. 

Therefore, claim 14 has inventive step over 
References 1 and 7. 

(9) Regarding * claim 15 lacks inventive step over 

References 1 and 8* 

In Reference 8, a search -requested character string 
Is subjected to morphogioal analysis to be divided into 
words. A search is performed for the divided words. 



7 



THIS PAGE BLANK lusno) 




Therefore, it is possible to search character strings having 
the same meaning but different expression. 

The invention of claim 15 is the same as the subject 
matter of Reference 8 in the point that a search-requested 
character string is subjected to morphoglcal analysis to be 
divided into words. However, in the invention of alaim 15, 
when a search is performed using a probability based on the 
number of characters (character elements) in a search 
character string and the number of matched characters 
(character elements), it is possible to prevent detection of 
words which are not a search target due to compound words or 
the like (see Example 14)* 

Therefore, alalm 15 has inventive step over 
References 1 and 8. 
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